Spectroscopic confirmation of a galaxy at redshift z=8.6 

M. D. Lehnert 1 , N. P. H. Nesvadba 2 , J.-G. Cuby 3 , A. M. Swinbank 4 , 
S. Morris 5 , B. Clement 3 , C. J. Evans 6 , M. N. Bremer 7 , S. Basa 3 

1 GEPI, Observatoire de Paris, CNRS, Universite Paris Diderot, 5 place Jules Janssen, 92190 
Meudon, France 

2 Institut d'Astrophysique Spatiale, UMR 8617, CNRS, Universite Paris-Sud, Batiment 121, 
F-91405 Orsay Cedex, France 

3 Laboratoire d'Astrophysique de Marseille, OAMP, Universit Aix-Marseille & CNRS 38 rue 
Frederic Joliot Curie, 13388 Marseille Cedex 13, France 

institute for Computational Cosmology, Department of Physics, Durham University, South 
Road, Durham DH1 3LE 

5 Department of Physics, University of Durham, South Road, Durham DH1 3AJ 

6 UK Astronomy Technology Centre, Royal Observatory, Blackford Hill, Edinburgh EH9 3HJ 

7 Department of Physics, H H Wills Physics Laboratory, Tyndall Avenue, Bristol, BS8 1TL, 

UK 



Galaxies had their most significant impact on the Universe when they assem- 
bled their first generations of stars. Energetic photons emitted by young, massive 
stars in primeval galaxies ionized the intergalactic medium surrounding their host 



galaxies, cleared sight-lines along which the light of the young galaxies could es- 
cape, and fundamentally altered the physical state of the intergalactic gas in 
the Universe continuously until the present day 1 ' 2 . Observations of the Cosmic 
Microwave Background 3 , and of galaxies and quasars at the highest redshifts 4 , 
suggest that the Universe was reionised through a complex process that was com- 
pleted about a billion years after the Big Bang, by redshift z^6. Detecting ionizing 
Ly-alpha photons from increasingly distant galaxies places important constraints 
on the timing, location and nature of the sources responsible for reionisation. 
Here we report the detection of Lya photons emitted less than 600 million years 
after the Big Bang. UDFy-38135539 5 is at a redshift z=8.5549±0.0002, which is 
greater than those of the previously known most distant objects, at z=8.2 6 7 and 
z=6.96 8 . We find that this single source is unlikely to provide enough photons to 
ionize the volume necessary for the emission line to escape, requiring a significant 
contribution from other, probably fainter galaxies nearby 9 . 

UDFy-38135539 was selected as a candidate z«8.6 galaxy from deep Wide Field Camera 
3 observations of the Hubble Ultra Deep field 5 . Its red Y105-J125 colour is one of the reddest 
in the parent sample of £=8-9 candidates, and, together with the sensitive upper limits in the 
optical through the Y 105 band, make it the most plausible z^8.6 galaxy 5,10 . To search for its 
Lya emission, we obtained sensitive near-infrared integral-field spectroscopic observations of 
UDFy-38135539 using the SINFONI spectrograph at the ESO Very Large Telescope, with an 
integration time on the source of 14.8 h in the near-infrared J-band (1.1-1.4/im). 
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Figure 1 Two representations of the spectrum of UDFy-38135539 showing its significance, a, The 

spectrum shows a faint emission line detected at 6a significance at a wavelength of 11,615.6A, corresponding 
to a redshift z=8.5549±0.0020 for Lya. The integrated spectrum was extracted from a square aperture of 5x5 
pixels, corresponding to 0.625"x0.625", which is approximately the size of the seeing disk. The measured line 
full width at half maximum is 9.2±1.2A, which is about la greater than the instrumental resolution. The line 
flux is (6.1±1.0)xlCT 18 erg cm~ 2 s -1 , detected at 6a significance. All of the line parameters (redshift, width, 
flux and significance) were estimated with a Monte Carlo simulation assuming a Gaussian line and randomly 
generated Gaussian noise similar to that estimated for the observed spectrum. We note that the absolute 
flux calibration may have a significant systematic uncertainty of up to 30-40%, but this does not affect the 
estimate of the significance of the line detection. The night sky spectrum, scaled arbitrarily, is shown in grey. 
Regions of particularly deviant values in the spectrum correspond to strong night sky lines. The emission line 
from the source lies fortuitously in a region relatively free of night sky contamination. We estimate that the 
percentage of regions in the night sky with a background as low as that near the detected line is approximately 
50% for 1.15-1.35/xm and is generally lower over the rest of the SINFONI J bandpass, b, The sky-subtracted 
two-dimensional spectrum shows the projection of the spectrum along the spectral and right-ascension axes 
of the data cube. It corresponds to a two-dimensional longslit spectrum obtained with a slit width of 0.625" 
positioned along right ascension on the sky. The object is indicated by the white circle, the regions affected 
by the night sky lines are labelled, and the range in the expected position of the source is marked. 

In our spectrum, we detect faint line emission at a wavelength of A=11615.6±2.4A, corre- 
sponding to a redshift z=8.5549±0.0002 assuming that the line is Lya (Fig. 1). This redshift 
is consistent with the redshift estimates made by comparing the photometry to models of spec- 
tral energy distributions 10 . We constructed a line image by spectrally summing a data volume 
containing the line (Fig. 2). Both the size of the emission and spectral width of the line are 
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what would be expected for a source of astrophysical origin. If the line were due to detector 



noise, this would generally lead to a line-width and source size that are smaller than the res- 
olution of the spectrograph and the smearing due to atmospheric turbulence (Supplementary 
Information). 

The photometry from the Hubble Space Telescope (HST) allows for an alternative (but 
unlikely) redshift of z=2.12, so we also investigate whether the emission line could be another 
emission line at lower redshift. In this case, the line may be the [On] A3, 726 and 3729 emission 
doublet, but we rule this out because the [Oil] doublet would be clearly resolved, and, hence, 
the line would be intrinsically wider than observed (details are given in the Supplementary 
Information). 
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Figure 2 Lyman-a line image of UDFy-38135539. The line image was constructed by summing the 
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region containing the emission in the wavelength direction. The inset shows the expected morphology of a 
point source with the same signal-to-noisc ratio in its centre as that of the source and the circle shows the size 
of the point spread function (PSF). The colour bar shows the significant relative to the root mean squared 
noise in the data set. The image has been smoothed by a Gaussian with the same width as the point spread 
function. The size of the line image is consistent with the expected size of an intrinsically unresolved source 
whose image is smeared out by the turbulence in the Earth's atmosphere and distortions induced by the 
telescope and instrument optics. 

For a standard A cold dark matter (ACDM) cosmology, the Hubble constant is 70 km s" 1 
Mpc -1 , the dark matter density is 0.3 and the dark energy density (cosmological constant) is 
0.7, the luminosity distance is d^=86.9 Gpc, and the total flux of Lja emission implies that 
the luminosity of the source of 5.5±1.0±1.8xl0 42 ergs _1 (1-sigma uncertainty and systematic 
uncertainty). In comparison, currently known Lja emitters over a wide range of redshifts 
(zw3-7) have typical luminosities of (3-10) x 10 42 ergs -1 without significant evolution 11 ' 12 . 
Thus UDFy-38 135539 can be considered a typical Lja emitting galaxy. 

At z=8.55, the observed H 160 -band window samples around 1,700A in the rest-frame ul- 
traviolet. The H-band magnitude implies a flux density of log(f i7Qo ^ (ergs -1 cm -2 Hz -1 )) = 
— 30.7±0.2 and an intrinsic luminosity density of log (L 17Q0 ^ (erg s -1 Hz~ 1 ))=28.3±0.2. The 
continuum luminosity density is about one magnitude fainter than M^y, the characteristic 
magnitude of the ultraviolet luminosity function, for galaxies with redshifts z=6-7 13 ' 14 . If the 
observed evolution of the luminosity function from z^3 to z^7 continues to z^8.6, this would 
imply that UDFy-38135539 is a typical M^ y galaxy 5 . 

Although we can not derive the characteristics of the stellar population in UDFy-38135539, 
similarly-selected galaxies at lower redshifts, z^5-7, seem to be young (ages between 10 and 
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100 million years, Myrs) and have both low metallicity and low extinction 15 17 . Plausible 
characteristics for the stellar population in UDFy-38135539 include low metallicity (10% of the 
solar metal abundance to essentially zero heavy elements), a mass distribution either similar 
to that of massive stars in local galaxies or with only very massive stars 18 , and ages between 10 
and 100 Myrs (but perhaps as old as 300 Myrs 19 ). Considering this range of mass distributions, 
metallicities and ages, we estimate the star-formation rate of UDFy-38135539, on the basis of 
the UV continuum luminosity, to be 2-4 Mq yr _1 . On the basis of the Lja luminosity, we 
estimate the star-formation rate to be 0.3-2.1 Mq yr _1 . However, we caution that, owing to 
the unknown absorption by the intergalactic or interstellar medium, the star-formation rate 
estimated using this Lyman-o; luminosity should be considered as a lower limit. 

Observing Lja emission in a galaxy at z=8.55 suggests that the surrounding inter-galactic 
medium must be ionized beyond ~1 Mpc from the source to allow the emission to escape. 
We note that the mean recombination time at z~8.6 is approximately a Hubble time at this 
redshift (~600Myr). Thus, once a region of the inter-galactic medium becomes ionized, we 
expect that it will be fossilised because the gas has insufficient time to recombine before the 
end of reionization. Moreover, because the time during which a source is a luminous emitter of 
ionizing photons is significantly less than a Hubble time, the sources that created any ionized 
bubble in the intergalactic medium may be very difficult to detect 20 . 

It is therefore instructive to investigate the possible size of the ionized region around 
UDFy-38135539. The total number of ionizing photons that UDFy-38135539 has produced 
allows us to estimate the size of the bubble it has ionized 9 . Adopting the star-formation rate 
derived from the ultraviolet continuum flux density, and assuming the range of character- 
istics as discussed above, we estimate that UDFy-38135539 will ionize a region of between 
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~0.1(f esc /0.1) 1 / 3 and 0.5(f esc /0.1) 1 / 3 Mpc in radius where i esc is the escape fraction of ionizing 
photons. With such a small radius, the neutral intergalactic medium surrounding the bubble 
will significantly suppress the intrinsic Lya emission from the source 21 ' 22 (Fig. 3). Outflows 
of gas driven by the star-formation within the source may help the escape of Lya radiation 23 . 
However, for the line emission to be highly redshifted relative to the systemic velocity of the 
sources, the optical depth to Lja must be high, further suppressing the emission. 

Given the difficulty of ionizing the surrounding medium sufficiently, the most likely expla- 
nation for the relatively strong Lya emission from UDFy-38 135539 is that other sources within 
a few megaparsecs of this source may also have contributed to ionizing this volume. Indeed, 
the likelihood that many sources contribute to the ionization of bubbles during reionisation 
has been discussed previously 20,24 . As the intergalactic medium is potentially 30-50% ionized 
at this redshift 25 , it would not be surprising that multiple ionizing sources exist in the vicinity 
of UDFy-38135539 that may also be responsible for ionizing a significant fraction of the entire 
volume of the intergalactic medium. 

Thus, the luminous Lya emission of UDFy-38135539 implies that it is probably surrounded 
by other sources that contributed to the reionization of the Universe, but which have not yet 
been discovered or characterized. With the current capabilities of the Wide Field Camera 3 
and sensitive spectrographs in the near-infrared on 8-10-m-class telescopes such as SINFONI 
(and, in the near future, the KMOS spectrometer, also at the Very Large Telescope), it may be 
possible directly to detect and characterize the galaxies during the epoch of reionization, help- 
ing us to understand how galaxies reionized the Universe. However, the biggest breakthroughs 
in our understanding of galaxies responsible for reionization will come with observations using 
the European Extremely Large Telescope and the James Webb Space Telescope over the next 
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decade. 




Figure 3 The predicted Lya flux for a given ultraviolet flux density. The Lya flux, fi, ya , is predicted 
assuming a range of characteristics for the stellar population within UDFy-38135539. The characteristics - age, 
metallicity and distribution of stellar masses - determine the relationship between the non-ionizing ultaviolet 
continuum at 1500A (with flux density f„) and the ionizing continuum with A <912A. We adopted a range of 
ages from 10 to 300 Myr for a metal poor stellar population given by a Scalo initial mass function 26 . Our other 
initial mass function is the one that only contains massive stars, > 100 Mq, which have zero metallicity 18 . For 
this top heavy initial mass function, we only considered ages of 10 and 100 Myrs because it is unrealistic for 
metal-free star-formation to persist for after the first supernova explosions which are expected after a few to a 
few tens of megayears. For all the calculations, we have assumed an escape fraction of ionising photons of 10%. 
Estimates of the escape fraction in the local Universe up to about z^3.3 suggest modest fractions of ~10% or 
less 27-29 . The black circle represents the UV continuum flux density and Lya flux of UDFy-38135539. The 
uncertainties in this data are shown in the inset (the 1-a random uncertainty and the systematic uncertainty 
added in quadrature). The uncertainty in the Lya flux is dominated by the systematic uncertainties which 
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are included in the error bars in the inset. Under our assumptions, the Lya flux of UDFy-38135539 is greater 
than that expected if it alone was responsible from ionising its local volume. Because the recombination time 
is long at z«8.6, ^600 Myrs, many of the sources responsible for ionising this region could have easily faded 
or may simply be of lower luminosity. 
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Supplementary Information 

The emission line presented here could have several plausible origins other than Lyman-a, in- 
cluding non-random detector noise, night-sky residuals, or another astrophysical emission line at 
lower redshift. Here we discuss the reduction methods, and provide several tests of the detection, to 
demonstrate why it is Lyman-alpha emission. 

SI Observations and Data Reduction 

We used SINFONI 31 on UT4 of the VLT to obtain deep, 3-dimensional spectroscopy of UDFy- 
38135539. We observed UDFy-38135539 in the J band, which covers the expected wavelength of 
Lya at a redshift z~8.5, the redshift determined from the multi-wavelength photometry available for 
this source 32 . We observed for 16 hrs, granted as Director's Discretionary Time (program ID 283. A- 
5058), and obtained a total of 53400 seconds of integration time on the source. Individual exposure 
times were 600s, and we adopted a dither pattern where the source remained in the field of view for 
all exposures. Observing faint, high- redshift galaxies requires a high accuracy in the pointing, better 
than 1". We therefore defined our pointing by acquiring a star at about 1.5' from the position of 
UDFy-38135539 32 at the be ginning of each Observing Block (a sequence of 5-6 individual exposures 
with a total length of 1 hr) . At the end of each Observing Block we observed a nearby star of known 
J-band magnitude and spectral type at a similar airmass. Flux scales were obtained from these stars, 
and we also used the size of the point spread function of these stars to monitor the seeing. 

An additional uncertainty can arise in the calibration due to both the variation in the atmospheric 
absorption at the wavelength of the line and to the crudeness of the flux calibration. The depth of 
absorption in the spectral region near the line is significant, «10-30%, and is variable depending 
on the amount of water vapour in the atmosphere and the telescope elevation during which the 
observations were taken. The data and standard stars are taken at somewhat different times and 
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air masses, adding additional uncertainty. Spectre-photometric calibration in the near-infrared is not 
very accurate. There are no generally available spectrophotometric standards as in the optical and 
the calibration star is used to simultaneously provide flux calibration and to remove the effects of 
atmospheric absorption. The crudeness of the calibration probably induces an additional uncertainty 
of 20-30% and changes in the atmospheric absorption between the observations of the source and 
calibration star can introduce systematic uncertaintes of about the same order. This additional 
uncertainty does not influence the significance of the detection, as the measurement of the noise and 
the line flux are affected by the same systematic uncertainties, but it does influence the accuracy of 
the estimate of the line flux. Systematic uncertainties in the absolute flux measurement are therefore 
of-order 30-40%. 

The instrumental resolution was measured from the widths of night sky lines extracted from a 
SINFONI cube near the wavelength of the line, and the error estimates are based on a Monte-Carlo 
simulation of the data with a Gaussian line profile and the noise characteristics of the region near the 
line. To determine the instrumental resolution, we used a data cube obtained in exactly the same 
manner as the science data with the only difference being that we did not subtract the night sky. 
The spectral resolution was measured to be 190 km s _1 or R=A/AA= 1580 at A ~1.16 / um. 

The data were reduced using our own data reduction software based on IRAF 33 . These routines 
have been tested and used extensively and have been discussed extensively in previous articles 34 ' 35 ' 36 ' 37 
A study similar to this one has been attempted previously 38 . 

S2 The Nature of the Line 

An obvious concern in searching for faint line emission from galaxies at the highest redshifts is that 
the detection is spurious. This is particularly worrisome for near infrared observations. Near-infrared 
detectors have pixels that do not behave linearly with exposure and/or have a significantly different 
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response from the vast majority of pixels. In addition, the line and continuum emission from the 
sky emission has structure and is also variable. These effects result in reduced data that do not have 
strictly Gaussian random noise. These effects can be largely, but not entirely corrected for during 
the data reduction. Because of this, it is important to make a number of comparisons between the 
properties of any putative emission line and those expected for a real signal. 

To show that the line is not an instrumental artefact, we have made several tests that an astro- 
physical line must meet at a minimum to be considered real. 

S2.1 Characteristics of the Line 

The line in the integrated spectrum has a width that is slightly larger than the instrumental reso- 
lution. The characteristics of the line detected in our cube are given in Table 1. While we would 
expect the Lya profile to be intrinsically asymmetric due to the absorption on the blue side of the 
profile due to neutral Hydrogen, the observed line is not resolved and thus has a Gaussian profile. 
Observations of Lyman-a emitting galaxies at redshifts around 5 also show symmetric profiles when 
they are approximately unresolved, even with much higher signal-to- noise than 6 39 . The faintness of 
the signal and low signal-to-noise makes it impossible to detect the faint, and possibly non-Gaussian 
wings often observed in Lya emitting galaxies. 

The extent of the line image is 0.6" x 0.7", similar to the size of the seeing disk which results 
from atmospheric turbulence. The WFC3 image 32 shows that the source has an intrinsic size in the 
continuum of 0.3". If the emission line region of the source is not larger than the continuum source 
by about a factor of 2, it would be unresolved in our data as observed. 

The line is found in a region of the night sky that is relatively free of bright night-sky lines. 
Night-sky line residuals, due to mechanical instabilities of the instrument leading to small (fractions 
of a pixel) wavelengths shifts, can lead to features that may be mistaken for real emission lines. The 
brightest of these spurious features arise in a part of the data cube that is near the edge of the slitlets 
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ID 


^obs 


redshift 


FWHM 


Flux 


Lya 


11615.6±2.4 


8.5549±0.0002 


9.2±1.2 


6.1±1.0 



Table 1: Emission-line properties measured from our SINFONI data of UDFy-38135539. Col- 
umn (1) - Line ID. Column (2) - observed wavelength. Column (3) - Redshift assuming the 
line is Lya at 1215. 67A. Column (4) - Measured full width at half maximum (FWHM) in 
units of A. The line is unresolved. Column (5) - Line flux in units of 10 -18 erg cm -2 s _1 . The 
uncertainty in the line flux is derived from repeated realisation of the line given the noise in 
the data set. There is a systematic uncertainty of the flux of 30-40% which is not included in 
the estimate given. 

of the slicer. In Fig. 1 of the main article, the night sky line residuals are more prominent near the 
periphery of the spectrum. The candidate Lyman alpha emitter is in a region of the cube that is 
unaffected by the edges of the slitlets. 

The line emission is within 0.4" of the expected position of the source in our data cube, and 
within about one half of the size of the point spread function. Due to the low signal-to-noise of the 
detection itself, uncertainties in the relative astrometric position of the offset star and target, and 
inaccuracies in the telescope offset, imply that the offset we observe in our cube is consistent with 
the position of the source. 

S2.2 Random or Correlated Noise? 

All of these suggest at a minimum that the emission line is consistent with being of astrophysical 
origin from the targeted source. Even though it appears that this is likely to be an emission line 
from a source, it could also be that this is a rare noise spike in the data that coincidently happens 
to have characteristics expected for an astrophysical source. We will now illustrate that the line 
with the properties we have estimated is also rare, rare enough not to be associable with random or 
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correlated detector noise. 

In our three-dimensional data cube we estimate that there are approximately 500,000 pixels 
where we expect night-sky line residuals that are similar to those in the area covered by the source 
(and which are contained in all exposures). We used a simple Monte Carlo approach to investigate 
how many of these pixels could produce a spurious noise signal similar to the actual signal that we 
have observed. 

We randomly selected a spatial position covered by the data cube and extracted the 25 one- 
dimensional spectra centred on that position (in a 5 by 5 spatial pixel region, specifically chosen to 
be approximately the size of one seeing disk). We then randomly selected a wavelength within the 
region covered by those spectra and, for each spectrum, fitted a Gaussian line profile centred at that 
wavelength. The width of the fitted Gaussian was constrained to be between the spectral resolution 
and 400 km s _1 and its total flux was allowed to vary between 3 and 90 times the noise level. We 
then determined how many of the 25 spectra had acceptable fits comparable to the detected feature. 

This procedure was repeated 25000 times and on 301 occasions (1.2%) resulted in reasonable 
fits for one or more of the 25 extracted spectra (see Fig. [I]). The maximum number of spectra with 
acceptable fits in any one trial was eleven, found in a single trial. There was not a single instance of 
all 25 spectra resulting in acceptable fits, as there was for the detected feature. Given the statistics 
shown in Figure 1 , the probability of generating such a feature purely from the noise characteristics 
of the data cube is at best one in 25000 (a confidence level of > 99.99%), and likely considerably 
lower. 

S2.3 A Spurious Line: Negative Line Image 

There are several other ways to argue that the line is not likely to be spurious. We searched for 
the negative images of the line which are a by-product of our observing strategy, and allow for a 
particularly powerful test of the astrophysical origin of our signal. We adopted a dither strategy 
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where the source is within the field of view at all times, but falls into different parts of the detector 
in subsequent individual exposures. This means that we can use one object frame to subtract the 
sky from another, and the presence of the source in each of these frames implies that we produce a 
characteristic negative signal in each individual sky-subtracted frame, where we subtract the source 
from an empty part of the sky. We will refer to this negative signal as the "shadow" . Identifying the 
shadow is a particularly powerful test, since the shadow arises from a different part of the detector, 
sharing only half the pixels with the source in each individual exposure. From our experience with 
integral-field spectroscopy with this instrument of over a 100 faint galaxies, we are not aware of any 
data cube where a shadow would have been produced by an instrumental artefact. 

Since the shadow does not fall in the same region of the detector for all dithers, and because 
the line is so faint, we reconstructed a dedicated "shadow cube" from the individual frames. We 
calculate the expected position of the shadow in each individual exposure from the measured target 
position. We then shift the individual cubes such that all shadows fall on the same position and 
combine the individual exposures to obtain a shadow cube, where the residual of the target falls at 
the same position as the galaxy in the original combined cube (Fig. [2]). The shadow line is clearly 
seen at the wavelength of the detected line (Gaussian line profile above the spectrum), and with a 
similar width and flux. 

We also evaluated the individual frames to infer whether the candidate emission line may be 
dominated by non-Gaussian noise originating from individual frames. We followed two approaches. 
We constructed two cubes each containing half of the frames. For Gaussian noise we expect to find a 
signal-to-noise ratio lower by y/2 in each subset, giving SNR~4.6<r. We extracted integrated spectra 
at the same position and with the same apertures as for the full data set, and measured the signal-to- 
noise ratio of each line. We find signal-to-noise ratios of 4.3a and 3.5<r for each subset, respectively. 
Given that the faintness of the detection alone makes each estimate accurate only within ~25% this 
is fully consistent with what is expected for half of the full data set. 
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To further evaluate whether the signal may be dominated by strongly deviant pixels in individual 
frames, we also extracted line images for each individual, fully reduced data cube, by collapsing over 
the wavelength range near the wavelength of our line detection. In total we cover a range of 30A 
(3x the full width at half maximum (FWHM) of the line) to be conservative. (By conservative, we 
mean that we took a spectral range much larger than necessary to extract the signal and therefore 
will possibly include additional sources of non-random noise). We then inspected each line image 
individually, in order to identify frames which were affected by bad pixels and/or night-sky line 
residuals at the position of the source. We found that 20 of the frames could be contaminated. 
However, since the line image covers a range of 30 A, which includes a faint night sky line residual 
slightly red-ward of the source, this does not imply necessarily that the line itself is affected. We 
combined all 20 of the contaminated frames (which had non-Gaussian noise near the position of the 
source) and also the remaining 69 frames that were not so affected (i.e., where the noise was better 
behaved and approximately Gaussian). We found a strong, narrow (FWHM~1.3 spectral pixels) 
spike in the cube constructed using the frames that were contaminated, but at the wavelength of 
a night-sky line, and recovered a faint line consistent with that found in the full data set in the 
combined cube of individual frames that were not contaminated. Within the larger uncertainties, 
this cube of uncontaminated frames also produced a line with a flux consistent with that found when 
using the full data set. We thus conclude that the frames with strongly non-Gaussian noise do not 
have a significant impact on our line measurement in the full cube. 

S3 Other Possible Sources of Line Emission 

Having shown that instrumental effects are unlikely to cause the observed emission line, could it be a 
line due to a source at lower redshift that is not Lyman-a? The very red colour and lack of detection 
in the optical (B435<29.8, V606<30.2, i775<29.8, zg5o<29.1 all 2-a limits) 40 makes it highly unlikely 
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that UDFy-38135539 could be an emission line galaxy at lower redshift > . The best fit redshift for 
the photometry is z=8.45 with an acceptable range of z=7.75-8.85. There is also a shallow (reduced 
Xv = 3) secondary minimum in the probability distribution at z=1.60-2.15, which could be caused 
by a large spectral break at 4000A in the rest-frame of the galaxy 40 . 

The much fainter and more stable backgrounds of the WFC3 on board the Hubble Space Telescope 
compared to ground-based near-infrared imagers make the photometry particularly robust. However, 
additional systematic uncertainties are possible, for example line contamination in the filter bandpass 
used to discover the source. Our measured line flux, taken at face value, would lower the intrinsic 
J125 magnitude from J=28.41±0.24 to Jf2s r =29.7±0.3, implying a bluer intrinsic continuum J-Y 
colour, Yio5-Ji25 r >l-0±0.3 (l-cr), and a lower redshift is possible. This illustrates the importance of 
emission-line measurements for candidate Lyman-break galaxies at the highest redshifts, and makes 
it worthwhile to specifically test all alternative redshifts that may be implied by the line detection. 

The only other plausible line identifications at lower redshifts are the generally bright lines from 
galaxies of Ha at 656. 3nm, which would imply a redshift of z=0.77, [OIII] at 500. 7nm, which would 
imply a redshift of z=1.32, and the [Oil] doublet at 372.6 and 372.9 nm, which would imply a 
redshift of z=2.12. Given the secondary minimum in the photometric redshift distribution around 
z~2 40 , we must make detailed tests of the possibility that the detected line could be [Oil] at z=2.12. 
For illustration purposes only, we have added an artificial doublet line profile to our data set after 
subtracting off our best fit profile from the line (Fig. [3]). The figure shows that any [Oil] doublet 
emission should be both resolved and wider than the current best fit single line. To go beyond 
this illustration and to show statistically that the [Oil] doublet is inconsistent with the properties 
of the line, we carried out a simple Monte-Carlo simulation. For this we generated 10 6 artificial 
data sets with the noise characteristics of our data cube, and added two delta functions separated by 
2.8(l+zr0jn)A which were then convolved with two Gaussian profiles with the resolution of our data. 
We assumed a line ratio R3726/3729 =!• I n a vas t majority of the cases (~99%) we see a resolved 
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doublet, and in all cases, a width of the doublet that is significantly greater than that of the detected 
line (with an average width of 15±1.2A). Similar results are found when testing a few other plausible 
ratios of the two [Oil] components. We can therefore rule out that the line is [OII]AA372.6,372.9 at 
z=2.12 at a level greater than 99.9% or 3-a because it produces a line that is clearly broader than 
that measured. 

From the photometry and the line flux, the observed equivalent width of the line is about 1900A. 
All hypotheses other than Lya at z=8.55 also imply implausibly high equivalent widths for optical 
lines of «600 to 1,100A, which are not observed in the integrated spectra of local galaxies or 
even the majority of the rare photometrically selected galaxies that have extremely high equivalent 
widths 42 . Even if we include a systematic uncertainty in the flux of several tens of percent, up to 
40%, this would still be highly implausible. We therefore conclude that the line is indeed most likely 
Ly-a at z=8.55. 

We finally conclude that the line detected is not spurious, is inconsistent with an instrumental 
artefact, and is inconsistent with being an emission line at lower redshift. 
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Figure 1: The "multiplicity" distribution. The multiplicity is the number of pixels that can 
adequately be fit with similar fit parameters within a box corresponding to the size of the seeing 
disk (5x5 pixels), surrounding a randomly chosen pixel in a region containing no signal from 
and have noise characteristics similar to that of the region around the detected line emitter. 
The detected line emitter has a multiplicity of 25, whereas in the noise has multiplicities <11. 
Having found a multiplicity of 11 in 1 trial of 25,000, we can rule out larger multiplicities due 
to noise at a confidence level >99.996%. 
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Figure 2: The integrated spectrum of the "shadow" in the data set. Our strategy for sub- 
tracting the sky and combining the data set produces a negative residual due to the presence 
of the source in all sky-subtracted frames. These shadow shares all of the exposures at the 
position of the source, but a completely different set of pixels in the "off" position compared 
to the positive image. The Gaussian profile marks the fit of the Lyman-a emission line and 
the negative residual is clearly visible. 
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Figure 3: An alternative identification for the line as arising from the doublet of singly ionised 
Oxygen. To illustrate that it is unlikely that the identified line is consistent with being 
[OII]AA372.6,372.9 at lower redshift we subtracted our best fit Gaussian profile for the data 
cube to remove the emission from the source. This produces a line-free spectrum with the 
noise characteristics for our data cube in the region of and near the wavelength of the line. 
At line centre, we added two delta functions separated by 2.8(l+Z[o//])A which have been 
convolved with a Gaussian profile. This profile has a width that is exactly the same as the 
spectral resolution of our data. For this illustration we assumed a line ratio R3726/3729 =1 and 
a redshift of 2.12. This redshift corresponds approximately to the secondary minimum in the 
redshift probability distribution derived from the WFC3 photometry 40 . The blue line shows 
the Gaussian fit to our data shifted along the ordinate for comparison. Clearly the doublet 
would be resolved spectrally and overall be broader than the detected emission line even at 
the low signal-to-noise of our detection. 
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